منابع مشابه
Evolufion from TD-SCDMA to TD-LTE
This paper gives the brief introduction to key features of TD-SCDMA and TD-LTE (TDD Long Term Evolution). For the TD-SCDMA system, key technologies such as joint detection, smart antenna and synchronization are described and analyzed. For the TD-LTE system, important technologies such as OFDM, MIMO, uplink timing control, inter-cell interference coordination are applied. It is shown that the TD...
متن کاملA unified view of TD algorithms, introducing Full-gradient TD and Equi-gradient descent TD
This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(λ), LSTD(λ), iLSTD, residual-gradient TD. It is asserted that they all consist in minimizing a gradient function and differ by the form of this function and their means of minimizing it. Two new schemes are introduced in that ...
متن کاملTD Networks
We introduce a generalization of temporal-difference (TD) learning to networks of interrelated predictions. Rather than relating a single prediction to itself at a later time, as in conventional TD methods, a TD network relates each prediction in a set of predictions to other predictions in the set at a later time. TD networks can represent and apply TD learning to a much wider class of predict...
متن کاملGeneralized TD Learning
Since the invention of temporal difference (TD) learning (Sutton, 1988), many new algorithms for model-free policy evaluation have been proposed. Although they have brought much progress in practical applications of reinforcement learning (RL), there still remain fundamental problems concerning statistical properties of the value function estimation. To solve these problems, we introduce a new ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal for Transdisciplinary Research in Southern Africa
سال: 2007
ISSN: 2415-2005,1817-4434
DOI: 10.4102/td.v3i2.326